biclustering algorithm for embryonic tumor gene expression dataset: las algorithm

نویسندگان

hamid alavi majd biostatistics department, faculty of paramedical sciences, shahid beheshti university of medical sciences, tehran

soodeh shahsavari biostatistics department, faculty of paramedical sciences, shahid beheshti university of medical sciences, tehran

soheila khodakarim school of public health, shahid beheshti university of medical science, tehran

seyyed mohammad tabatabaei medical informatics department, faculty of paramedical sciences, shahid beheshti university of medical sciences, tehran

چکیده

an important step in considering of gene expression data is obtained groups of genes that have similarity patterns. biclustering methods was recently introduced for discovering subsets of genes that have coherent values across a subset of conditions. the las algorithm relies on a heuristic randomized search to find biclusters. in this paper, we introduce biclustering las algorithm and then apply this procedure for real value gene expression data. in this study after normalized data, las performed. 31 biclusters were  discovered that 26 of them were for positive gene expression values and others were for negative. biological validity for las procedure in biological process, in molecular function and in cellular component were 77.96% , 62.28% and 74.39% respictively. the result of biological validation of las algorithm in this study had shown las algorithm effectively convenient in discovering good biclusters.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Biclustering Algorithm for Gene Expression Data

Cheng-Church (CC) biclustering algorithm is the popular algorithm for the gene expression data mining at present. Only find one biclustering can be found at one time and the biclustering that overlap each other can hardly be found when using this algorithm. This article puts forward a modified algorithm for the gene expression data mining that uses the middle biclustering result to conduct the ...

متن کامل

A Parallel Algorithm for Gene Expressing Data Biclustering

Biclustering of the gene expressing data is an important task in bioinformatics. By clustering the gene expressing data obtained under different experimental conditions, function and regulatory elements of the gene sequence can be analyzed and recognized. A parallel biclustering algorithm for gene expressing data is presented. Based on the anti-monotones property of the quality of the data sets...

متن کامل

UniBic: Sequential row-based biclustering algorithm for analysis of gene expression data.

Biclustering algorithms, which aim to provide an effective and efficient way to analyze gene expression data by finding a group of genes with trend-preserving expression patterns under certain conditions, have been widely developed since Morgan et al. pioneered a work about partitioning a data matrix into submatrices with approximately constant values. However, the identification of general tre...

متن کامل

QUBIC: a qualitative biclustering algorithm for analyses of gene expression data

Biclustering extends the traditional clustering techniques by attempting to find (all) subgroups of genes with similar expression patterns under to-be-identified subsets of experimental conditions when applied to gene expression data. Still the real power of this clustering strategy is yet to be fully realized due to the lack of effective and efficient algorithms for reliably solving the genera...

متن کامل

A Repeated Local Search Algorithm for BiClustering of Gene Expression Data

Given a gene expression data matrix where each cell is the expression level of a gene under a certain condition, biclustering is the problem of searching for a subset of genes that coregulate and coexpress only under a subset of conditions. The traditional clustering algorithms cannot be applied for biclustering as one cannot measure the similarity between genes (or rows) and conditions (or col...

متن کامل

A Linear Time Biclustering Algorithm for Time Series Gene Expression Data

Non-supervised machine learning methods have been used in the analysis of gene expression data obtained from microarray experiments. Recently, biclustering, a non-supervised approach that performs simultaneous clustering on the row and column dimensions of the data matrix, has been shown to be remarkably effective in a variety of applications. The goal of biclustering [1] is to find subgroups o...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
journal of paramedical sciences

جلد ۴، شماره ۰۲، صفحات ۰-۰

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023